
1 10 I . IIOFI M \\\-I..\ ROCV!- AG 



120 PROCESS FOR Till-: M^n'UFACTI RE OF CAROTENOIDS AND 
BIOLOGICALLY USEFUL MATERIALS THEREOF 

130 SOD 

•140 
141 

150 EP99123821.3 

•151 1999-12-01 

160 17 

• 1"0 Patcntln Vcr. 2.0 

210 1 

211 3632 

212 DNA 

213 Phaffia rhodozyma 
220 

221 5'UTR 

222 (922).. (923) 

223 EXPERIMENTAL 

220 

221 exon 

222 (9X6).. (1096) 

220 

221 intron 

'222 (1097). .(1306) 

220 

221 exon 

■ 222 ( 1307). .(1456) 

220 

221 intron 

222 (1457). .(1555) 




220 ~ 

221 cxon 

222 (155())..( 1589) 

220 • 

221 j intron 
222>(1590)..(1694) 

220> 

221 • exon 

222: (1695)..(1799) 

220 

221 • intron 

222 (180())..(1920) 

220 

221 cxon 

222 (1921). .(1982) 

220 

221 intron 

222 (1983)..(2076) 

• 220 • 

221 cxon 

222 -(2077U2140) 

• 220 • 

221 intron 

222 (2141). .(2246) 

22u- 

22 1 ■ cxon 

• 222 -(2247). .(2272) 

• 220 > 

221 > intron 

222 - (2273). .(2390) 

22t) > 

22 1 > cxon 

222> (2391 )..(2507) 



• 221 polyA site 

- 222 - (2663).. (2664) 
-.-223 > EXPERIMENTAL 

• 400 1 

tcctgttgat aatctttcta acgccttgta ctttgaccaa ggcgtttgtc cgaaaUUg 60 
caaacttagt gttggtcgca tggaeggtet teggatccag aactgaegge tegecaataa 120 
agtatgacga tggtagaggt gaaggaggga accacaggtt gaccagtctc aaagagtgct 1 80 
gatgtgcgcg aggatttgtc attaaatggt gltgtatatg etagagecaa gagaagacat 240 
ttggttttgg ttttggtttt gcatttgatg agatgtgtca cgattgaaga egggaggagg 300 
ctcactaacc caagaageca ggatcaggag gaatgcctcc cccttttcat caagatcttt 360 
ctcacatcga acatttgaca ttctctttag tatccttcta tccttttctt ccaacttctc 420 
ccattgtatc gaetttgetc gaettgetet tcttatctct gagcagagat gggcattcca 480 
atatcgaagg agegacacaa gaccttggag tttgggtaac agatgaagag gggccgaggt 540 
ggatggggct gtaggaagta gctgatcgat gagttcctgg atgatgatag gcgaaggaac 600 
agacatagga tctctgtctc gtcctggaat tactgagtct tgtatccagc gtgttcttgt 660 
ctcgaagaag ccttcaagat cgatgtaaga taagacaggc aatgaggacg gacgaacgaa 720 
cgaacgaaaa gaacagaaga gctggtaagt cagtcagtca gtcagtcagt caatcaaaca 780 
ctggtgtcta gggttatagc tcgacgcgac gcgacgcgtt tgagacgega tatgettacg 840 
taatacctgg cgtcatcccc ccagccgagg caagagcega gccgctcgtg aacgacaaaa 900 
ttcaaaaggc tttctccatc ttaagctcat tctcatctaa ccgactcatc tcgttcccat 960 
cattcccatc attctaccgc catcc atg tct git cga gca tec etc tct tec 1012 
gtg tct aga cag act ttc gtc get cct gel get ttc eag ate agg gea 1 060 
aag cat acc ctg cct gag ctt cct tac get tac gat gtaagacttt 1 1 06 
tccgtgttct cctattcgtc gctttcttgg tttttttcgt cttcgccctc tagctcttct 1 166 




tcgtcctttc tglcclgclc Ittgtlgttg alatlcagct cgalagacta acccatctca 1226 
tctcctggac attcttttac tggaaacgta tcttgtcctt ggtttttctt ggctttggtt 1286 
gaaaattcct ctccactcag gcc ctg gag ccc tec ate tec aag gag ate atg 1339 
acc ctt cac cae ace aag cac cat cag act tat gtt aac ggc etc aae 1 387 
get gcc gag gag age lac teg gcc get gtg ggc aag gag gat gig ctt 1435 
acc cag gtt aag ctt cag tct gtacgtctga ccgttttttt ategacegga 1 486 
acgcctggtg aggagggaga tgaagtttga tgagegctea tegtctagea cgttgacccg 1546 
atcatacag get etc aag ttc aac gga gga gga cac ate aat e 1 589 

gtcagtgata ttcttcaaac tettgetgag caagtcaggt caagctgact gtttcgcttt 1649 
gtttctgegg atctatctca tccttgattt ggcatgatga aacag ac tct ctg ttc 1 705 
tgg aag aac ttg get ccc tat gga tec gag gag get acc etc tct gaa 1753 
gga cct etc aag aag get ate gag gaa tct ttt ggt tct ttc gag g 1 799 
gtccgtccat ctatcttcct attcagttgt gtttggttcc ggtatactca tctgttttgt 1859 
ttccccacaa aataaaaata aaaatcttgt cctctccggg ggttcgactg caegttcata 1919 
g cc ttc aag aag aag ttc aac get gac acc get get gtc caa gga tec 1967 
gga tgg ggc tgg ctt gtatgtatca tatcctttcc atctcaaact cttctcagag 2022 
tctttttcct tgagacttca aactgactat acatgtttct acaacaaaca acag ggc 2079 
ttg aac ccg ctt act aag aag ctg gaa gtc acc acg acc gcc aac cag 2 1 27 
gac cct ctg ctt a gtaagttgtt tctacatgat tttctatctc aacgegatet 2 1 80 
i'cateattcn tcactgattc act^attct cttgtttcgt ttttctcggg atgatttcat 2240 
aaacag ct cac att cct ate ate gga gtt gac gtgegtatet ttcttgaata 2292 
<*t(Ma"cut ctuatctcut tttattuact uacutgttgc ttctgtccaa atcattaaaa 2352 
aaaatgaaaa caaataatcg attgaccgac gaaaacag ate tgg gag cae get ttc 2408 



lac ctt cag tac aag aac gtc aag cct gac tat etc get get gtt tgg 2456 
tec gtt ate aac tac aag gag gea gag gee cga ttg cag get get etc 2504 
taa gcgggacgaa aagtaacgac atatgaaggg aggatcaaat ategtttett 2557 
cataaacaac tttcgaggca gatgggagag tacgtacaag agaggtttgt atggagaatt 2617 
gagtttgttg aeggttagea ggttatgata tatgtagcta tagtctagtc taaatctgaa 2677 
agaagagaac aagatggttt gtccgaagag attgagagat caagcccggt catctgatgt 2737 
cgaacaaaca tgccctggtc tgccaacagt ttctagcaca ttatgaccat gttcatgtgt 2797 
aaattgggaa atgagecaga aaggtttatt atctaattca ttgattcatg cgactatgga 2857 
tacatatggg atttccagaa caaacagatg caacaaagca eggcatttte caaagatcga 291 7 
gtcctcccac aagtatgegg caaggtttgt tgttaagaga lataaaagca gacgacaaaa 2977 
caaatcgttt atcgaccctg tgcaccaaca ccgtgaccgt ttgacgagtt ggtagagttg 3037 
tagttgttgc tgttcaaagg agctccagac tggaegctte caagcttcaa caacttctcg 3097 
gcagcgtcgc tgttcgggaa aagaaaaggc aaaaaggaac agagegataa gcatatgtga 3157 
ttctctactt cttataggct cttagctcaa gtcaactcac atgtctttgg eggtaccgaa 3217 
gaegttctea agetgetget tggaagcttt tccgagcttg ccagtaggtc cctggttgga 3277 
gaagaagatg tcgaaggcta agggegatga aaagcatgaa gatattagct ateggegega 3337 
taaaagtgtg acgagatgaa aatggagaaa agatgattcg caccatcgae gacctcgacc 3397 
aaaggaatgg aggtgtcacc ggccttccac ttcttgtact cctcaacgtt gacgaagatg 3457 
acgaagcagt cggtggcctt agcctcgggc tegtagatge tgatgaaaca caataggtag 351 7 
taggagagga gaaagagaag atgatgagat gtcaggatgc ttgettcact gtagatggag 3577 
gaagaagata tgcgaagcaa gacatacact ttggaaagag cttgaaccal tgtag 3632 

• 210 -2 
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221 > ex on 

• 222 > (2525).. (2542) 

-220- 
22 1 poly A site 

• 222 (2607). .(2608) 

< 223 > EXPERIMENTAL 

• 400 • 2 

cttatccttc tgccgctggt ctctgtctgt cgagtgtgtg ggatggtttt ggatatgttc 60 
ctatacgaaa ggtagcgcag agcaaagctg acagtattaa gcaagacaag agcttctttc 120 
tgttgacaga tgaaaggacg aactatgaag ctgtccatgc tccccaaacc gattgacaca 1 80 
ccgccgtcag geaaegcaga atttctcact gettcgaegt cacaccaaca tcgatcctcc 240 
atacctaaaa gcagatcgag acacattgtt ggtcgccatg Uggatggat gtacatcaaa 300 
cccacagcat atatcactca catgtgagaa ctccgtagcc tctaccttct tgtctctcaa 360 
tctgaatgtc tegttgagag gtggaatgaa tgtttacagt ttgagaagac gaaagaaaga 420 
aagagaagag aagagaggaa taegtacgae gaagttatca tcgtatggga acttttctaa 480 
aaaactgect atagtagaga cgatctctgg aggaaagctc tgtagtatga tagtgaagag 540 
cgagcaagtc tgggcaagtg catccttcgt ctacaagaaa gagaccagga aatgaaggag 600 
agacgagtaa gcaggtacct accgatattg gategttetc tctacccagc gatgecttea 660 
ccacgcgttc tatctcttct tgggatggca gatacatact taacgagagc aatctgatgt 720 
ataccgaact tegaaeggaa tgatcccaga atcctcttga acccttgaac ccttgaaccc 780 
tggaaccaag taccaaccga gcaacacgcc gataeggtec acaccacaga accacacgcc 840 
ctegtcatta aaggtgggac gcgccgatgc tggttacgtt cggcccaalc eggaagttae 900 
eggcttggae gtgcctgtaa ccatgccctg aeggtattte gccttcagct aactccatct 960 
catctttttc ctttactacc acaacccacc cttgaacctt cttccccggc ttttttacta 1020 
tatccatcta tcaatcatc atg get cet tac act ctt ccc gac gtaagcttaa 1073 




agtttgagct gtgtgtgctt atctcaatet tggagttgaa ctcaccgttt tttgUtttg 1 133 
cttcctggtt tttttatcgg catccctcct ttttttcccc tcgtggtcgc atatgatttg 1 193 
ctcatcaatc ggcgtttccc atgcatcttt gtcatccgtt tcag ctl cct tac get 1249 
tac gat gec ttg gag cct tac ate tct aag g gtgagattct tagtcagaet 1 300 
gttgttccgg ttcgacacga tagctaatcg tctctcgttc ctcaatatga aeatgcag 1358 
aa ate atg ate ctt cac cac tec aag cac cat cag act tac gte 1402 
gtaegtaate taaaggtcat ctccgtctac atggccggat caacttgctc atagatctte 1462 
cttctgttcg gcgctacgta g acc aac etc aac gec get ate eag get ttc 1513 
tec cag acc aat gac ate aag gec cag ate get ctt cag age get etc 1 561 
aag ttc aac gga gga gga cac ate aac c gtacgatcat tctccctctt 1 609 
ctggcttatc atatgtgttg cttgtcacta acacgcatgc aaccccggga tatctcaccc 1669 
tgtag ac tec etc ttc tgg aag aac atg get cct gee gac tct get gat 1718 
gec aag etc acc gag gga teg etc aag act gee ate gac aag gac ttt 1 766 
gga tec ttc gag gag ttc aag aag aag ttc aac act get act etc ggt 1814 
gtc cag gga tct gtcagtatct cgtttgcttc gacatactct cagctttcct 1 866 
teegtaaact gacgaatagt tttteggaca tgtacttgta g gga tgg gga tgg etc 1 922 
gtgcgtttga cctttttcca ctttgaacat tagegatagt gatacctaac aactgtgaat 1982 
tggaatatag gga tac aac ace get acc aag cac etc gag ate gec acc 2031 
acc gec aac cag gat ccc ctt ate a gtatgtgact tctctcgtgt 2076 
ggtcaccata agccagttgc tgacacattt egttegctgt ctctcgactt cgtag ct 2 1 33 
ttg act ccc ate att ggt ctt gac gttagtaatt ctatctagtg attggagtcg 21 87 
agttctgaac ttgecttgat ctcaaacgaa tgaatcaatt tcttttggta g ate tgg 2244 
uau cac uct ttc tac etc caiz tac aae aat etc aau cct gat tac ctt g 2293 



gtacgtaatt ctctattcgt ttgccccggt ttgatctttg actcactctt caaaatgttt 2353 
tcgtttgta actttgaaaa acag cc get tie tgg aac gtc tgc aac ttt gel 2405 
gag get cag ega agg ttt gat gtgagtacag gcgctacccc tacggaggaa 2456 
ucsaaeut'ja uctyaccact ttttatcttt cUatttgga atgaacgatc cgatgatcaa 25 1 6 
acaaacag get get gtc aag get taa tggtcccatt tatctctttg attcgaegge 2572 
gatgaegget ttctcgcatc cgaagaaggc aaggctatga ttactgttat tetgecatgt 2632 
tgcttgcttt getatgetet atgttctttt cttttgcctc tcttcaaagc caaggegtta 2692 
aggaaggece ttcagtctgt tttacatatg cacatataca tgagaacata tcacggactc 2752 
ggcggctggt ggtcctcttg agegtegget tcaagattag tgtccacacg tgaagcgttc 2812 
ggtgccatcc aacctggtag gaatccccat egggegggaa tccaattatc aattggcggt 2872 
eggecagatt egagcteggg tatctcagaa gcgtcaagcg ggegcattte caggecttta 2932 
agaggagcaa atttaatccg cctgggtgtt cagegagaca cgaacagttt gaaacagagt 2992 
ctgcttgtga gttactegge gagatcactg aggactaaac tttctcagct cgtggacgaa 3052 
aagaacgaac caaaeggtet tccctgtatc tcgaccatct ccttctccat ctcttacaac 3112 
accteggatg aactccaagg cttgctttcc aaagttcaaa caaactccgg gttgccatcc 3172 
acctggtttg tctctaacga gecgagggat atecategtt cggaacgttt gaacagactg 3232 
gatggtaggt ggccggtcgc tteggaagee aatcataatg gtgggaatcg agagaaggaa 3292 
tgattgggcc cagtgtttaa gacttgtgtt tgttggcaga gtaeggaegg aaagtaggac 3352 
agacttaatc aaggegagee aag 3375 
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• 213 • Phaffia rhodozvma 



220 > 

221 ■•- exon 
222>(1)..(34) 

220 > 

221 intron 

222 (35)..( 115) 



220- 



exon 



222: ( 1 1 7) .( 1 59) 
220: ■ 

22 ]■ intron 

222: (160)..(242) 

220: • 

221 exon 

222 (243)..(363) 

220 

221 intron 
■ 222 ■ (364). .(436) 

220 ■ 

221 • exon 

• 222 (437).. (951) 

400 • 3 

tec gga age tea gat ace get cga gat cct cga g gtttgtgtgc 44 
tttcgetttg ttcgcatgga tgaagctgtt aacttaaaaa aatcctcgtg tttctctttg 104 
tttcaacata g gt ttc tct ctt aag gtc aag ace tct gag gga aac tgg 153 
gac ttt gtaegtatte ttatcgactg agtcatcaag ctegttateg ctctcttacc 209 
ctcatccttt tgtgtctctg tctacacctc tag gtc gga aac aac act ccc ate 263 
ttt ttc ttg aga gac cca gee aag ttt ccg ate ttc att cac acc cag 3 1 1 
aag agg aac ccg cag aca aac tct aaa gac aag gac get ttc tgg gac 359 
tac c uttegtata accttgtcac tccctgcgtg ccgctctgat tcatgttgac 4 1 2 



cttgtctttg alalaatttt atag ta tec eaa aac ccc gag tec gtg cat cag 465 
gtg ctg cac ctg ttc agt gat cga gga ace cct get let tac cga cac 5 1 3 
atg cat ggt lac tel gga cac acc ttc aag atg gtc aac agg aac ggt 561 
gac tgg aat tat gtc cag atl cac atg cgc acc gat cag ggt gtc aag 609 
act cac acc aat gaa gag get teg aaa etc gac gee tec aat ccc gat 657 
tea aac gga gac gac ttg ttc gac gca ate aag aat gga gac ttc cct 705 
age tgg acg gtt cag gta cag gta atg tct cct gag cag gec cag aag 753 
ttc aga tac aac att ctg gat etc acc aag gtc tgg tec cac aag gag 80 1 
ttc cca ctt agg acg att gga aag ttc act ttg aac cga aac gtg gat 849 
aac tat ttc gca gag gtt gaa cag etc gec ttt get cct tec cat ctg 897 
cct cct gga ate gag ccc teg aac gat ccc gtc ctt cag get cga eta 945 
ttc tec 95 1 



210: 4 
211: 669 
212: DNA 

213 Phaffia rhodozyma 
220 ■ 

:221 CDS 
•:222 • (1 )..(669) 

:40() • 4 

atg tct gtt cga gca tec etc let tec gtg tct aga cag act ttc gtc 48 
Viet Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phc Val 
1 5 10 15 

get cct get get ttc cag ate agg gca aag cat acc ctg cct gag ctt 96 
Ala Pro^Ala Ala Phe Gin He Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

cct tac get tac gat gee ctg gag ccc tec ate tec aag gag ate atg 144 
Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu lie Viet 



35 40 45 

acc ctt cac cac acc aag cac cat cag act tat gtt aac ggc etc aac 192 
Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn (ily Leu Asn 
50 55 ' (>0 

get gec gag gag age tac leg gee get gtg ggc aag gag gat gtg ctt 240 
Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 

acc cag gtt aag ctt cag tct get etc aag ttc aac gga gga gga cac 288 
Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 
85 ' 90 95 

ate aat cac tct ctg ttc tgg aag aac ttg get ccc tat gga tec gag 336 
He Asn His Scr Leu Phe Tip Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 110 

gag get acc etc tct gaa gga cet etc aag aag get ate gag gaa tct 384 
Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 

ttt ggt tct ttc gag gee ttc aag aag aag ttc aac get gac acc get 432 
Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 135 140 

get gtc caa gga tec gga tgg ggc tgg ctt ggc ttg aac ccg ctt act 480 
Ala Val Gin Gly Ser Gly Trp Gly Trp Leu Gly Leu Asn Pro Leu Thr 
145 150 155 160 

aag aag ctg gaa gtc acc acg acc gee aac cag gac cct ctg ctt act 528 
Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 

cac att cct ate ate gga gtt gac ate tgg gag cac get ttc tac ctt 576 
His He Pro lie lie Gly Val Asp lie Trp Glu His Ala Phe Tyr Leu 
180 185 190 

cag tac aag aac gtc aag cct gac tat etc get get gtt tgg tec gtt 624 
Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Scr Val 
195 200 205 

ate aac tac aag gag gca gag gee cga ttg cag get get etc taa 669 
He Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 " ' 215 220 



■•210- 5 
. -> | ] .. 222 

212 PRT 

• 213 ■ Phaffia rhodozyma 

• 400 • 5 

Met Ser Val Arg Ala Ser Leu Ser Ser Val Scr Arg Gin Thr Phc Val 
1 5 1 ( > 15 

Ala Pro Ala Ala Phe Gin lie Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser lie Ser Lys Glu lie Met 
35 ' 40 45 

Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 

Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 

Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 
85 90 95 

He Asn His Ser Leu Phe Tip Lys Asn Leu Ala Pro Tyr Gly Scr Glu 
100 105 HO 

Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 

Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 ' 135 140 

Ala Val Gin Gly Ser Gly Trp Gly Tip Leu Gly Leu Asn Pro Leu Thr 
145 150 155 100 

Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 

His He Pro lie lie Gly Val Asp He Trp Glu His Ala Phe Tyr Leu 
180 185 190 

Gin Tvr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 



195 200 205 

He Asn Tyr Lys Glu Ala Glu Ala Arg Leu Ciln Ala Ala l eu 
210 " 215 220 



-210-6 
■'21 1' 597 

212 DNA 
<213> Phaffia rhodozyma 

• 220 • 

221 CDS 

222 ■ (1)..(597) 

400 - 6 

atg get cct tac act ctt ccc gac ctt cct tac get tac gat gec tig 48 
Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
1 5 10 15 

gag cct tac ate tct aag gaa ate atg ate ctt cac cac tec aag cac 96 
Glu Pro Tyr He Ser Lys Glu lie Met He Leu His His Ser Lys His 
20 25 30 

cat cag act tac gtc ace aac etc aac gec get ate cag get ttc tec 144 
His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 

cag acc aat gac ate aag gec cag ate get ctt cag age get etc aag 1 92 
Gin Thr Asn Asp lie Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 

ttc aac gga gga gga cac ate aac cac tec etc ttc tgg aag aac atg 240 
Phe Asn Gly Gly Gly His lie Asn His Ser Leu Phe Trp Lys Asn Met 
65 70 75 80 

get cct gec gac tct get gat gee aag etc acc gag gga teg etc aag 288 
Ala Pro Ala Asp Ser Ala Asp Ala Lys Leu Thr Glu Gly Ser Leu Lys 
85 90 95 

act gec ate gac aag gac ttt gga tec ttc gag gag ttc aag aag aag 336 
Thr Ala lie Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 




ttc aac act get act etc ggt gtc cag gga tct gga tgg gga tgg etc 384 
Phc Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp ( ily Tip Leu 
115 120 125 

gga tae aac aec get acc aag eac etc gag ate gee ace ace gee aac 432 
Gly Tvr Asn Thr Ala Thr Lys His Leu Glu lie Ala Thr Thr Ala Asn 
130 135 140 

cag gat ccc ctt ate act ttg act ecc ate att ggt ctt gac ate tgg 480 
Gin Asp Pro Leu lie Thr Leu Thr Pro He He Gly Leu Asp lie Trp 
145 150 155 160 

gag eac get ttc tae etc cag tae aag aat gtc aag cct gat tae ctt 528 
Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 

gee get ttc tgg aac gtc tgc aac ttt get gag get cag cga agg ttt 576 
Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 

gat get get gtc aag get taa 597 
Asp Ala Ala Val Lys Ala 
195 



• 210 ~ 
211 • 198 

• 212 • PRT 

<213 ■ Phaffia rhodozyma 
<400 • 7 

Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
1 5 10 15 

Glu Pro Tyr lie Ser Lys Glu lie Met He Leu His His Ser Lys His 
20 25 30 

His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 

Gin Thr Asn Asp lie Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 

Phe Asn Gly Gly Gly His He Asn His Ser Leu Phe Trp Lys Asn Met 
65 " 70 75 80 



Ala Pro Ala Asp Scr Ala Asp Ala Lys Leu Thr Glu Cily Scr Leu Lys 
85 90 95 

Thr Ala 11c Asp Lys Asp Phc Gly Scr Phc Glu Glu Phc Lys Lys Lys 
100 ' 105 110 

Phc Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp (ily Trp Leu 
115 1 20 1 25 

Gly Tyr Asn Thr Ala Thr Lys His Leu Glu lie Ala Thr Thr Ala Asn 
"l 30 135 140 

Gin Asp Pro Leu He Thr Leu Thr Pro He lie Gly Leu Asp lie Trp 
145 150 155 160 

Glu His Ala Phc Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 

Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 

Asp Ala Ala Val Lys Ala 
195 



210: 8 

21L 714 

212: • DNA 

213 Phaffia rhodozyma 
-.220 

:221 CDS 

:222 (1)..(714) 

<400 • 8 

tec gga age tea gat acc get cga gat cct cga ggt ttc tct ctt aag 48 
Ser Gly Scr Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Scr Leu Lys 
1 5 10 15 



gtc aag acc tct gag gga aac tgg gac ttt gtc gga aac aac act ccc 96 
Val Lys Thr Scr Glu Gly Asn Trp Asp Phc Val Gly Asn Asn Thr Pro 
20 25 30 



ate ttt ttc ttg aga gac oca gec aag ttt ccg ate ttc att cac acc 1 44 
He Phc Phc Leu Arg Asp Pro Ala Lys Phc Pro lie Phe lie His Thr 
35 40 45 

cag aag agg aac ccg cag aca aac let aaa gac aag gac get ttc tgg 1 ( )2 
Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phc Trp 
50 ' " 55 o() 

gac tac eta tec caa aac ccc gag tec gig cat cag gtg ctg cac ctg 240 
Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 

ttc agt gat cga gga acc cct get tct tac cga cac atg cat ggt tac 288 
Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 W 05 

tct gga cac acc ttc aag atg gtc aac agg aac ggt gac tgg aat tat 336 
Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 
100 105 110 

gtc cag att cac atg cgc acc gat cag ggt gtc aag act cac acc aat 384 
Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 12(f 125 

gaa gag get teg aaa etc gac gec tec aat ccc gat tea aac gga gac 432 
Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 

gac ttg ttc gac gca ate aag aat gga gac ttc cct age tgg acg gtt 480 
Asp Leu Phe Asp Ala lie Lys Asn Gly Asp Phe Pro Ser Tip Thr Val 
145 150 155 160 

cag gta cag gta atg tct cct gag cag gec cag aag ttc aga tac aac 528 
Gin Val GlnVal Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 
165 170 175 

att ctg gat etc acc aag gtc tgg tec cac aag gag ttc cca ctt agg 576 
lie LeuAsp Leu Thr Lys Val Trp Ser His Lys Glu Phe Pro Leu Arg 
180 185 190 

acg att gga aag ttc act ttg aac cga aac gtg gat aac tat ttc gca 624 
Thr He Qy Lys Phe Thr Leu Asn Arg Asn Val Asp Asn Tyr Phe Ala 
195 ' 200 205 

»ag gtt gaa cag etc gec ttt get cct tec cat ctg cct cct gga ate 672 




Glu Val Glu Gin Leu Ala Phe Ala Pro Scr His Leu Pro Pro Gly lie 
210 215 220 

gag ccc teg aac gat ccc gte ett cag get cga eta ttc tec 7 1 4 

Glu Pro Scr Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Scr 
225 230 235 



• 210 9 
■ :21 1= 238 
<212~ PRT 

:21 3> Phaffia rhodozyma 
<40(J: • ( ) 

Ser Gly Ser Scr Asp Thr Ala Arg Asp Pro Arg Gly Phe Ser Leu Lys 
1^5 H) 15 

Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 

20 25 30 

He Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe lie His Thr 
35 40 45 

Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 

Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
05 70 75 80 

Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 90 95 

Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Tip Asn Tyr 
" 100 105 110 

Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 

Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 

Asp Leu Phe Asp Ala He Lys Asn Gly Asp Phe Pro Ser Trp Thr Val 
145 150 155 160 

Gin Val Gin Val Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 



165 170 175 

He Leu Asp Leu Thr Lys Val Tip Ser His Lys Glu Phc Pro Leu Arg 
180 185 1W 

Thr lie Gly Lys Phe Thr Leu Asn Arg Asn Val Asp Asu Tyr Phc Al 
H)5 ' ' 200 205 

Glu Val Glu Gin Leu Ala Phc Ala Pro Ser His Leu Pro Pro Gly lie 
210 215 220 

Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phc Scr 
225 230 235 



• 210 10 

• 211-23 

• 212 DNA 

• 213 • Artificial Sequence 
220 ■ 

223 Description of Artificial Sequence:Sodl (sense 
primer for cloning of SOD genes) 

400 10 

aarcaycayc aracntaygt naa 2 - 



210 > 11 

211 -23 

212 DNA 

213 Artificial Sequence 
220 ■ 

223 ■ Description of Artificial Sequence:Sod4 (antiscnsc 
primer for cloning of SOD genes) 

400 > 11 

gcccanccng anccytgnac ncc — 1 



210 • 12 
-211 >26 

212 DNA 
<213> Artificial Sequence 



"220 • 

- 223 • Description of Artificial Sequencc:Sod 14 (sense 
primer for the construction of SOD1 -disrupting 
plasmid) 

• 400: - 12 

ggtacctccg atgataggaa tgtgag 26 



■ 210 • 13 
•211: 26 

• 212: • DNA 

■ 213 Artificial Sequence 
220 • 

■ 223 Description of Artificial Scquence:Sodl 5 

(antisense primer for the construction of 
SOD 1 -disrupting plasmid) 

400 • 1 3 
gaattcagtt caaeggagga ggacac 



210 14 

• 211 -26 

• 212 DNA 

213 Artificial Sequence 

• 220 • 

223 ■ Description of Artificial Sequence:Sod47 (sense 
primer for the construction of SOD2-disrupting 
plasmid) 

400 • 14 

uaatteggag gaggacacat caaccg - ( 



210 > 15 

211 >26 

212 DNA 

:213 ; Artificial Sequence 



22<> 

223-- Description of Artificial Sequence:Sod48 



• 



(antisense primer for the construction of 
SOD2-disrupting plasmid) 

• 400 ■ 15 

ggtacctgta ctggaggtag aaagcg 26 

210 16 
2 1 1 : • 23 
212 l)\A 

'213^ Artificial Sequence 
220 

'22X- Description of Artificial Sequence:Sod2 (sense 
primer for cloning of CAT gene) 

•400; 16 

mgnttytcna cngtnggngg nga 23 

210 17 
•211 23 
212; DN A 

213; • Artificial Sequence 

• :220; 

• 223: Description of Artificial Sequence:Cat5 (antisense 

primer for cloning of CAT gene) 



• :400> 1 7 

ckrtgnckyt gngtrtcngg rta 



23 



